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RAW SEQUENCE LISTING DATE: 12/05/2001 

PATENT APPLICATION: US/09/925,140 TIME: 13:04:59 

- Input Set :. N:\Crf3\RULE60\09925140.txt _ 

Output Set: N:\CRF3\12052001\I925140.raw 

SEQUENCE LISTING 

5 (1) GENERAL INFORMATION:. 

7 (i) APPLICANT: LAL, PREETI 

8 CORLEY, NEIL C. 

9 GUEGLER, KARL J. 
10 PATTERSON, CHANDRA 

12 (ii) TITLE OF INVENTION: SERINE DEHYDRATASE HOMOLOG 

14 (iii) NUMBER OF SEQUENCES: 2 

16 (iv) CORRESPONDENCE ADDRESS: 

17 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

18 (B) STREET: 3174 Porter Drive 

19 (C) CITY: Palo Alto 

20 (D) STATE: CA 

21 (E) COUNTRY: USA 

22 , (F) ZIP: 94304 
24 • (V) COMPUTER READABLE FORM: 
2 5 (A) MEDIUM TYPE: Diskette 
2 6 (B) COMPUTER: IBM Compatible 
2 7 (C) OPERATING SYSTEM: Windows 

28 (D) SOFTWARE: FastSEQ for Windows Version 2.0b 

30 (vi) CURRENT APPLICATION DATA: 

C--> 31 (A) APPLICATION NUMBER: US/09/925,140 

C--> 32 (B) FILING DATE: 08-Aug-2001 

33 (C) CLASSIFICATION: 

35 (vii) PRIOR APPLICATION DATA: 

36 . , (A) APPLICATION NUMBER: 09/088,435 

37 (B) PILING DATE: 1998-06-01 
39 (viii) ATTORNEY/AGENT INFORMATION: 
4 0 (A) NAME: Cerrone, Michael C 
41 (B) REGISTRATION NUMBER: 39,132 
4 2 (C) REFERENCE/DOCKET NUMBER: PF-0512 US 

44 (ix) TELECOMMUNICATION INFORMATION: 

45 (A) TELEPHONE: 650-855-0555 

46 (B) TELEFAX : 650-845-4166 

47 (C) TELEX: 
50 (2) INFORMATION FOR SEQ ID NO : 1 : 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 325 amino acids 

54 (B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 

56 (D) TOPOLOGY: linear 

58 (vii) IMMEDIATE SOURCE: 

59 (A) LIBRARY: THP1AZS08 

60 (B) CLONE: 2752518 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

64 Met Asp Gly Pro Val Ala Glu His Ala Lys Gin Glu Pro Phe His Val 

65 1 5 10 15 
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Output Set: N:\CRF3\12052001\I925140.raw 

6 6 Val Thr Pro Leu Leu Glu Ser Trp Ala Leu Ser Gin Val Ala Gly Met 

67 20 25 . 30 

68 Pro' Val 'Phe Leu Lys ' Cys Glu-Asn Val Gin Pro Ser Gly ^Ser Phe Lys 

69 35 40 45 

70 lie Arg Gly lie Gly His Phe Cys Gin Glu Met Ala Lys Lys Gly Cys 

71 50 55 60 

72 Arg His Leu Val Cys Ser Ser Gly Gly Asn Ala Gly lie Ala Ala Ala 

73 65 70 75 80 

74 Tyr Ala Ala Arg Lys Leu Gly lie Pro Ala Thr lie Val Leu Pro Glu 

75 85 90 95 

76 Ser Thr Ser Leu Gin Val Val Gin Arg Leu Gin Gly Glu Gly Ala Glu 

77 100 ' 105 110 

78 Val Gin Leu Thr Gly Lys Val Trp Asp Glu Ala Asn Leu Arg Ala Gin 

79 115 120 125 

80 Glu Leu Ala Lys Arg Asp Gly Trp Glu Asn Val Pro Pro Phe Asp His 

81 130 * 135 140 

82 Pro Leu lie Trp Lys Gly His Ala Ser Leu Val Gin Glu Leu Lys Ala 

83 145 150 155 160 

84 Val Leu Arg Thr Pro Pro Gly Ala Leu Val Leu Ala Val Gly Gly Gly 

85 165 170 175 

86 Gly Leu Leu Ala Gly Val Val Ala Gly Leu Leu Glu Val Gly. Trp Gin 

87 180 185 190 

88 His Val Pro lie lie Ala Met His Gly Ala His Cys Phe Asn Ala Ala 

89 195 200 205 

90 lie Thr Ala Gly Lys Leu Val Thr Leu Pro Asp lie Thr Ser Val Ala 

91 210 215 V. . 220 

92 Lys Ser Leu Gly Ala Lys Thr Val Ala Ala Arg Ala Leu Glu Cys Met 

93 225 230 235 240 

94 Gin Val Cys Lys lie His Ser Glu Val Val Glu Asp Thr Glu Ala Val 

95 , 245 250 255 

96 Ser Ala Val Gin Gin Leu Leu Asp Asp Glu Arg Met Leu Val Glu Pro 

97 260 265 270 

98 Ala Cys Gly Ala Ala Ala lie Tyr Ser Gly Leu Leu Arg Arg Leu Gin 

99 275 280 285 

100 Ala Glu Gly Cys Leu Pro Pro Ser Leu Thr Ser Val Val Val He Val * 

101 290 295 "300 

102 Cys Gly Gly Asn Asn He Asn Ser Arg Glu Leu Gin Ala Leu Lys Thr 
103, 305 310 315 320 

104 His Leu .Gly Gin Val 

105 325 

107 (2) INFORMATION FOR SEQ ID NO : 2 : 

109 (i) SEQUENCE CHARACTERISTICS: 

110 (A) LENGTH: 1485 base pairs 

111 (B) TYPE: nucleic acid 

112 (C) STRANDEDNESS : single 

113 (D) TOPOLOGY: linear 

115 (vii) IMMEDIATE SOURCE:.. 

116 (A) LIBRARY: THP1AZS08 

117 (B). CLONE: 2752518 
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Input Set : N:\Crf3\RULE60\09925140.txt 
Output Set: N:\CRF3\12052001\I925I40.raw 



1 1 Q 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 








"191 
J. ^ J. 






.ATTAGGATGa TGCATGAGTG 


ATPGGPAGTG 


PPPGGGAAAG 




19 9 


CGGTGAGGGT 


TGCTCTPATC 

X W X \mr X ^-^il X N— ' 


PPPTPTPPTP 


PTPPGTPTTP 


APPPGGAGGP 


TTAGGGTPTG 


12 0 


1 9 ^ 
J. ^ J 


X 


TTTAACAAAG 

X X X ii-Xi Ai-iVtl NJ 


GAGGAGGGAP 


PAAGGTTGPP 


GGAAGPTGPP 


TGAAGPTGGA 


180 


1 9A 


V_^f\ VJ VJ \^ \^\J \J X 


TrTTGGAAAG 


AGPTGGTTPP 


PTGGPAGGPT 


pfZArzpppAGG 


AGPTGGGGPP 


24 0 


1 9 R 


r\\^\j \^ X X X 




GGPAGGGAGG 


PTGTPTAPPT 


GGTPTPPAGA 


ATGGAPGGPP 


300 


J. Z 0 




GPATGPPA AG 


CAGGAGCCC^ 


TTPAPGTGGT 

X X \^rl^^>7 X X 


PAPAPPTPTG 


TTGGAGAGPT 
X X yj\:3 t\.\j t\\j \^ X 


J o u 


19 7 
J. Z / 






\J\J\^c\. X Vjj X 


TPTTPPTPA A 

X ^ X X X ^rU^ 


GTGTGL.GA AT* 


GTiGPAPPPPA 


A9n 

*r Z V> 


1 9 P 
J. Z 0 




PA AGATTPGG 


GGPATTGGGP 


TV m m rp p m p p A 
rVX X X ^ X\7^wr\ 


ppap 2i ^cccr^ 


t\i\\J t\c\\3\J\3 1\ X 


Aft n 

*i O U 


1 9 Q 




\J\J A.\J X V7 X V-.V_^ 


TPAGGGGGTA 


ATGPGGGP AT 




rp A TP PTP P T" a 
Xr\XVjVwXVjV_'Xf\ 


RA n 


J. J U 


. GG2Lh.Gr*TGGG 


PAT'TPPTGPP 


APPATPGTGP 


TPPPPGAGAG 




PAGGTGGTGP 


D U U 


1 1 




O Vj Vj Vj \j 


GPPGAGGTTP 


AGPTGAPTGG 


a a apprppTPP 


GaPGAGGPPa 


D O VJ 






PPA AP aPT'T'P 


GCCAAGAGGG 


ACGGCIGGGA 


r*A A 'T'r''T'PPPP 


PPPT"T"T'P a PP 


79 n 


J. J J 


ACGCCCTAAT 


TV m/^ TV TV TV 

ATGGAAAGGC 


CACGCCAGCC 


TGGTGCAGGA 


ppTip a a appA 


PTippipp a PP a 


7ft n 


134 


CCCCACCAGG 


TGCCCTGGTG 


CTGGCAGTTG 


GGGGTGGGGG 


TCTCCTGGCC 


GGGGTGGTGG 


O /I A 

840 


135 


CTGGCCTGCT 


GGAGGTGGGC 


TGGCAGCATG 


TACCCATCAT 


TGCCATGGAG 


ACCCATGGGG 


900 


136 


CACACTGCTT 


CAATGCGGCC 


ATCACAGCCG 


GCAAGCTGGT 


CACACTTCCA 


GACATCACCA 


960 


137 


GTGTGGCCAA 


GAGCCTGGGT 


GCCAAGACGG 


TGGCCGCTCG 


GGCCCTGGAG 


TGCATGCAGG 


1020 


138 


TGTGCAAGAT 


TCACTCTGAA 


GTGGTGGAGG 


ACACCGAGGC 


TGTGAGCGCT 


GTGCAGCAGC 


1080 


139 


TCCTGGATGA 


TGAGCGTATG 


CTGGTGGAGC 


CTGCCTGTGG 


GGCAGCCTTA 


GCAGCCATCT 


1140 


140 


ACTCAGGCCT 


CCTGCGGAGG 


CTCCAGGCCG 


AGGGCTGCCT 


GCCCCCTTCC 


CTGACTTCAG 


1200 


141 


TTGTGGTAAT 


CGTGTGTGGA 


GGCAACAACA 


TCAACAGCCG 


AGAGCTGCAG 


GCTTTGAAAA 


1260 


142 


CCCACCTGGG 


CCAGGTCTGA 


GGGGTCCCAT 


CCTGGCCCCA 


AAGACCCCTG 


AGAGGCCCAT 


1320 


143 


GGACAGTCCT 


GTGTCTGGAT 


GAGGAGGACT 


CAGTGCTGGC 


AGATGGCAGT 


GGAAGCTGCC 


1380 


144 


CTGTGCAACT 


GTGCTGGCTG 


CCTCCTGAAG 


GAAGCCCTCC 


TGGACTGCTT 


CTTTTGGCTC 


1440 


145 


TCCGACAACT 


CCGGCCAATA 


AACACTTTCT 


GAATTGAAAA 


AAAAA 




1485 



DATE: 12/05/2001 
TIME: 13:04:59 
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VERIFICATION SUMMARY DATE: 12/05/2001 

PATENT APPLICATION: US/09/925,140 TIME: 13:05:00 

Input Set : N:\Crf3\RULE60\09925140.txt 
Output Set: N:\CRF3\12052001\I925140.raw 

-L:31 M:220 C: Keyword misspelled. or invalid format, [(A) APPLICATION NUMBER:] 
L:32..M:220 C:^.Keyword misspel^^ format, [(B) FILING DATE:] 
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